Skip to content

fix restart epoch bug in trainer#72

Merged
michaelmckinsey1 merged 2 commits into
LBANN:mainfrom
PatrickRMiles:miles30/fix_restart_epoch
May 28, 2026
Merged

fix restart epoch bug in trainer#72
michaelmckinsey1 merged 2 commits into
LBANN:mainfrom
PatrickRMiles:miles30/fix_restart_epoch

Conversation

@PatrickRMiles

Copy link
Copy Markdown
Collaborator

Fixes a bug where training always starts at epoch 1, even when resuming from a checkpoint.

@michaelmckinsey1 michaelmckinsey1 self-requested a review May 28, 2026 17:03
@michaelmckinsey1 michaelmckinsey1 merged commit dee404c into LBANN:main May 28, 2026
1 check passed
michaelmckinsey1 pushed a commit to michaelmckinsey1/ScaFFold that referenced this pull request Jun 11, 2026
Co-authored-by: Patrick Miles <miles30@tioga.llnl.gov>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants